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Abstract. We investigate the probability distribution of the return intervals r between successive 1-min volatilities 
of two Chinese indices exceeding a certain threshold q. The Kolmogorov-Smirnov (KS) tests show that the two in- 
dices exhibit multiscaling behavior in the distribution of r, which follows a stretched exponential form fq(T/{r)) ~ 

e - a (T/<T>)"' w j m different correlation exponent 7 for different threshold q, where (t) is the mean return interval cor- 
responding to a certain value of q. An extended self-similarity analysis of the moments provides further evidence of 
multiscaling in the return intervals. 

PACS. 89.65.Gh Economics; econophysics, financial markets, business and management - 89. 75. Da Systems obeying 
scaling laws - 05.45. Tp Time series analysis 



1 Introduction 

The analysis of the waiting time between two successive events 
is helpful to understand the dynamics of stock markets, which 
has drawn much attention. A variety of waiting time variables 
have been raised by different definitions of event to characterize 
the stock markets from different view angles, such as the per- 
sistence probability HHUO, the exit time 1U|5][6][7M]H), and the 
intertrade duration [ 10J. TTIfT2lfT3l . Recently the return intervals 
between successive extreme events exceeding a certain thresh- 
old q have been investigated for numerous complex systems, 
including rainfalls, floods, temperatures and earthquakes 031 
[T5lfT6lfr7lfT8ll . Similar analysis was subsequently carried out 
concerning the volatility return intervals, which are defined as 
the waiting times between successive volatilities exceeding a 
certain threshold in stock markets. 

Yamasaki et al. and Wang et al. used the daily data and in- 
traday data of US stocks to study the properties of the volatility 
return intervals [19 20 2 1 22 1 . They found that the distribution 
of return intervals r between successive volatilities greater than 
a certain threshold q showed scaling behavior. This scaling be- 
havior is expected to be of great importance for the risk assess- 
ment of large price fluctuations. Similar scaling behavior was 
observed in the return intervals of daily and 1 -min volatilities of 
thousands Japanese stocks 11231 . Qiu, Guo and Chen analyzed 
the high-frequency intraday data of four liquid stocks traded in 
the emerging Chinese market, and found that the return interval 
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distributions of the Chinese stocks investigated also followed a 
scaling behavior l24l . 

In contrast, Lee et al. investigated the return intervals of 1- 
min volatility data of the Korean KOSPI index ll25l and no scal- 
ing was observed. Wang et al. used the Trade & Quate Database 
of the 500 constituent stocks composing the S&P 500 Index 
and found a multiscaling behavior in the volatility return inter- 
vals i26l . A systematic deviation from scaling was observed 
in the cumulative distribution of return intervals, which im- 
plies that its probability distribution also deviates from scal- 
ing. Moreover, the m-th moment of the scaled return intervals 
showed a certain trend with the mean interval, which supports 
the finding that the return intervals exhibit multiscaling behav- 
ior. This finding was reinforced by further analysis of 1 137 US 
common stocks ll27l . Ren, Guo and Zhou used a nice high- 
frequency database j28 | to study the interval returns of 30 most 
liquid stocks in Chinese stock market |j29l . The Kolmogorov- 
Smirnov (KS) test was adopted to examine the possible col- 
lapse of the interval distributions for different threshold val- 
ues. Only 12 individual stocks passed the KS test and showed a 
scaling behavior, while the remaining 18 stocks exhibited mul- 
tiscaling behavior. 

In this paper, we study the distribution of the volatility re- 
turn intervals of two Chinese stock indices, i.e., Shanghai Stock 
Exchange Composite Index (SSEC) and Shenzhen Stock Ex- 
change Composite Index (SZCI). According to the KS test and 
the extended self-similarity (ESS) analysis of the moments, we 
find that the return intervals of the two indices exhibit multi- 
scaling behavior, consistent with the multiscaling behavior of 
some individual stocks which partially compose the indices. 
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The paper is organized as follows. In Section [2] we explain 
the database analyzed and how the volatility return intervals 
are calculated. Section[3]examines the scaling behavior and the 
curve fitting of the return interval distributions using the KS 
tests. In Section 21 we further study the multiscaling behavior 
by analyzing the moments of the scaled return intervals. Sec- 
tion [5] concludes. 



where (r) is the mean return interval that depends on the thresh- 
old q. If the function f q (x) is independent of q, there exists 
a universal function f{x) such that f q (x) = f(x) for differ- 
ent values of q. In other words, the probability distributions 
/ g (r/(r}) of the scaled return intervals collapse onto a single 
curve /(t/ (t)) and the return intervals exhibit scaling behav- 
ior. 



2 Preprocessing the data sets 

Our analysis is based on the high-frequency intraday data of 
two Chinese indices, the Shanghai Stock Exchange Composite 
Index (SSEC) and the Shenzhen Stock Exchange Composite 
Index (SZCI). Each composite index is constructed based on 
all the stocks listed on the corresponding exchange. The indices 
are recorded every six to eight seconds from January 2004 to 
June 2006. We define the volatility as the magnitude of loga- 
rithmic index return between two consecutive minutes, that is 
R(t) = I In Y(t)— In Y(t-1)\, where the index Y is the closest 
tick to a minute mark. Thus the sampling time is one minute, 
and the volatility data size is about 140,000. 

The intraday volatilities of both indices exhibit a L-shaped 
intraday pattern l30l . similar to the individual stocks B0II29I . 
When dealing with intraday data, this pattern should be re- 
moved [20,21 22 26 24]. Otherwise, the return intervals dis- 
tribution will exhibit daily periodicity for large thresholds. The 
intraday pattern A(s) is defined as 



A(s) 



1 N 



(i) 



which is the volatility at a specific moment s of the trading day 
averaged over all N trading days and R l (s) is the volatility at 
time s of day i. The intraday pattern is removed as follows 



R'(t) 



A( s y 



(2) 



Then we normalize the volatility by dividing its standard devi- 
ation 

(t) = ____ Hi 

V( ' {(R'(t) 2 ) - {R'{t)) 2 }V 2 ' <• ' 



3 Probability distribution of return intervals 

3.1 Probability distribution of scaled return intervals 

We study the return intervals r between successive volatilities 
exceeding a certain threshold q. A series of return intervals are 
obtained for each particular threshold q and its number de- 
creases with increasing threshold q. For each value of q, we 
can obtain empirically a probability distribution P q (r) of the 
volatility return intervals, which is related to the probability 
distribution / 9 (r/(r)) of the scaled return intervals r/(r) as 
follows 

P,(r) = i^f q (r/(r)), (4) 



To investigate whether the return interval distributions of 
the two Chinese indices exhibit scaling behavior, we plot in 
Figure Q] the empirical probability distributions /q(T/(r)) = 
P 9 (t)(t) as a function of the scaled return intervals t/ (r) for 
a wide range of threshold q = 2, 3, 4, 5. It is evident that the 
curves for different thresholds q show systematic deviations 
from each other and do not collapse onto a single curve, es- 
pecially for the Shanghai Composite Index. This indicates that 
the distributions of return intervals for both indices could not 
be approximated by a scaling relation. With the increase of the 
threshold q, there are more large scaled return intervals and the 
distribution becomes broader. 
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Fig. 1. (Color online) Empirical probability distributions of scaled 
return intervals for different threshold q = 2, 3, 4, 5 for the Shanghai 
Composite Index and the Shenzhen Composite Index. The solid curves 
are the fitted functions ce~ ax with the parameters listed in Table[2] 
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The observation that there is no scaling behavior in the 
volatility return interval distributions is consistent with the re- 
sults of a previous study of individual Chinese stocks 1291 , The 
Kolmogorov-Smirnov test shows that only 12 stocks out of 30 
most liquid Chinese stocks exhibit scaling behaviors in the re- 
turn interval distributions for different thresholds q, while the 
other 18 stocks do not show scaling behavior l29l . 



3.2 Kolmogorov-Smirnov test of scaling in return 
interval distributions 

The eyeballing of the probability distributions offers a qualita- 
tive way of distinguishing scaling and nonscaling behaviors. 
Here we further adopt a quantitative approach based on the 
Kolmogorov-Smirnov test. The standard KS test is designed to 
test the hypothesis that the distribution of the empirical data is 
equal to a particular distribution by comparing their cumulative 
distribution functions (CDFs). Our hypothesis is that the two 
return interval distributions for any two different q values do 
not differ at least in the common region of the scaled return in- 
tervals 11231 . Suppose that F qi is the CDF of return intervals for 
qi and F q . is the CDF of return intervals for qj, where q. L ^ qj. 
We calculate the KS statistic by comparing the two CDFs in 
the overlapping region: 

KS = m ax (\F qi -F qj \), qi ^ qj . (5) 

When the KS statistic is less than a critical value CV, the hy- 
pothesis is accepted and we can assume that the distribution for 
qi is coincident with the distribution for qj . The critical value 
at the significance level of 5% is CV = 1.36/ ^Jmnj (m + n), 
where m and n are the numbers of interval samples for qi and 
qj, respectively. 

In Table [T]is depicted the KS statistics and the correspond- 
ing critical values for the two indices. For the Shanghai Com- 
posite Index, KS > CV for all (<& , qj ) pairs except (<& ,qj) = 
(4, 5). It means that the distribution for q — 5 coincides with 
the distribution for q = 4, but significantly differs from the dis- 
tributions for other q values. Similar phenomenon is observed 
for the Shenzhen Composite Index. Therefore, we can conclude 
that the distributions differs for different q and do not collapse 
onto a single curve. The KS test confirms the result that the 
return interval distributions do not exhibit scaling behavior. 



3.3 Fitting the return interval distributions 

For those stock markets showing scaling behavior in the volatil- 
ity return interval distributions, it is a consensus that the scaling 
form could be approximated by a stretched exponential func- 
tion ll20Tl2TTl23ll^ll26ll27l . 

f q (x) = f(x) = ce- ax \ (6) 

where c and a are two parameters and 7 is the correlation expo- 
nent characterizing the long-term memory of volatilities. There 
is nevertheless exceptions. Based on the KS test and the weighted 
KS test, Ren, Guo and Zhou showed that the scaled return in- 
terval distributions of 6 stocks (out of the 12 stocks exhibiting 
scaling behavior) can be nicely fitted by a stretched exponential 
function with 7 « 0.31 at the significance level of 5% 11291 . 

In this work, we have demonstrated that the return interval 
distributions of the two Chinese indices do not follow a scal- 
ing form. It is still interesting to check if the (scaled) return 
intervals follow a stretched exponential distribution expressed 
in Eq. © but with different values of parameters c, a and the 
correlation exponent 7 for different threshold q. In this case, 
our hypothesis is that the empirical distribution is coincident 
with its best fitted stretched exponential function. Similar to 
the KS test we have conducted for two empirical samples, we 
use the KS statistics to test whether the distribution for a cer- 
tain threshold q is identical to its best fitted distribution in the 
overlapping region of the scaled return intervals. Let F q be the 
cumulative distribution for q and F$e the cumulative distribu- 
tion from integrating the fitted stretched exponential. The KS 
statistic defined in Eq. (0 becomes 

KS = max{\F q -F SE \), qe {2,3,4,5}. (7) 

Then the bootstrapping approach is adopted 03111321 . To do 
this, we first generate 1000 synthetic samples from the best fit- 
ted distribution and then reconstruct the cumulative distribution 
-Fsim of each simulated sample and its CDF F s i m sE from inte- 
grating the fitted stretched exponential. We calculate the values 
of KS between the fitted CDF and the simulated CDF using 

KS sim = max (\F sim - -F sim .SE|) ■ (8) 

The p- value is determined by the frequency that KS sun > KS. 
The tests are carried out for the two Chinese indices. The pa- 
rameters of the fitted stretched exponential and resultant p- 
values for different q are depicted in Table [2] 



Table 1. The Kolmogorov-Smirnov test of return interval distribu- 
tions by comparing the statistic KS with the critical value CV. 



Table 2. The Kolmogorov-Smirnov test of return interval distribu- 
tions by comparing empirical data with the best fitted distribution and 
synthetic data with the best fitted distribution. 



Shanghai Composite Index Shenzhen Composite Index 



Qi 


Qj 


KS 


CV 


Qi 


Qj 


KS 


CV 


2 


3 


0.0363 


0.0201 


2 


3 


0.0321 


0.0211 


2 


4 


0.0720 


0.0296 


2 


4 


0.0401 


0.0294 


2 


5 


0.1170 


0.0434 


2 


5 


0.0535 


0.0408 


3 


4 


0.0436 


0.0329 


3 


4 


0.0244 


0.0326 


3 


5 


0.0870 


0.0456 


3 


5 


0.0444 


0.0432 


4 


5 


0.0445 


0.0506 


4 


5 


0.0192 


0.0478 





Shanghai Composite Index 


Shenzhen Composite Index 


Q 


c 


a 


7 


P 


c 


a 


7 P 


2 


0.80 


2.05 


0.59 


0.80 


0.67 


3.51 


0.47 0.02 


3 


2.13 


14.20 


0.38 


0.63 


1.55 


14.85 


0.37 0.54 


4 


0.92 


5.79 


0.43 


0.58 


1.71 


22.01 


0.34 0.72 


5 


1.05 


14.19 


0.35 


0.74 


1.56 


25.06 


0.33 0.85 



The p value could be regarded as the probability that the 
empirical distribution consists with its best fit. Consider the 
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significance level of 1%. If the p- value of an index for a cer- 
tain threshold q is less than 1 %, then the null hypothesis that the 
empirical PDF of this index can be well fitted by a stretched ex- 
ponential is rejected. According to Table[2] the null hypotheses 
for all the q values are accepted for both two Chinese indices. It 
is noteworthy to point out that the p-values for all the q values 
(except for q — 2 for the Shenzhen Composite Index) are very 
large, implying high goodness-of-fit of the stretched exponen- 
tial to the empirical PDFs. At the significance level of 5%, the 
stretched exponential is rejected when q = 2 for the Shenzhen 
Composite Index. To show how good the stretched exponen- 
tial fits the data, we illustrate in Figure Q] the fitted stretched 
exponential with the parameters listed in Table |2] It is obvi- 
ous that the empirical PDFs could be well fitted by a stretched 
exponential. In principle, the stretched exponential fits the em- 
pirical PDF better when the p-value is larger. For instance, the 
stretched exponential fits the empirical PDF for the Shanghai 
Composite Index better than the Shenzhen Composite Index 
when q = 2. 

According to Table [2] the parameters differ from one an- 
other, providing further evidence supporting our conclusion that 
the return interval distributions do not have a scaling form. On 
average, the exponent 7 decreases with increasing threshold q, 
which is in line with the US stocks l27ll . 



4 Moments of scaled return intervals 

The distributions of return intervals exhibit multiscaling behav- 
ior and show a systematic tendency with the threshold q. To fur- 
ther study this tendency of the interval distribution with q, we 
compute the moments of the scaled return intervals x = rj (r) 
defined as 



of (r) when m > 1. These two types of moment functions are 
delimited by the horizontal line fix = 1. 



Mm = <(r/(r>r) 



1/m _ 



x m f q {x)dx 



1/m 



(9) 



where the mean interval (r) is dependent of the threshold q. 
When m = 1, we have fix = 1 by definition, independent of 
q. If there is a scaling behavior that f q (x) = f(x), the m-th 
moment fj, m is a univariate function of the order m and is in- 
dependent of any other variables including the threshold q and 
the mean return interval (r). On the contrary, the m-th moment 
/x m is not constant with respect to (r) for m 7^ 1, when there 
is no scaling in the return interval distributions. 



4.1 Dependence of moment on mean return interval 

We first investigate the relation between fi m and (r). To better 
quantify the dependence of p, m on (r), we calculate the mo- 
ments in a certain medium range of (r) to avoid the finite size 
effect and discreteness effect 0261 . Figure [2] illustrates the mo- 
ments [i m for m = 0.25,0.5, 1.5,2.0 versus (r) for the two 
Chinese indices. We investigate \i m for a range of (r) corre- 
sponding to 1 < q < 5. Each curve of the moments fj, m sig- 
nificantly deviate from a horizontal line, confirming the multi- 
scaling behavior in the return interval distributions. The mo- 
ment function (j, m decreases with the increase of (r) when 
m < 1, and shows an increasing tendency with the increase 




Fig. 2. (Color online) Moment /i m vs (r) for the two Chinese indices: 
Shanghai Composite Index and Shenzhen Composite Index. For each 
data set, four moments m = 0.25, 0.5, 1.5, 2.0 are presented. 

Take a careful look at /Lt m in Figure 12 for in < 1 (to > 
1) fj, m first decreases (increases) rapidly with the increase of 
(r), and then starts to decrease (increase) relatively slowly at 
(t) = 10. The discreteness of the records of t responds to the 
rapid increase (decrease) of p m for small (r) ((r) < 10). The 
moment for extremely large (r), i.e., (r) > 100 ((r) is in units 
of standard deviations), will increase (decrease) for m < 1 
(to > 1) due to the finite size effect. We choose to study /i m in 
a medium region 10 < (r) < 100 where in the effects of finite 
size and discreteness are small and /i m shows a clear power- 
law-like trend with (r). We use a power law to fit the moment 
in this medium region, 



(r) Q - 



(10) 



If the PDF of return intervals follow a scaling form, /i m is inde- 
pendent of (r) according to Eq. (0 and the exponent a should 
be some value very close to 0. If the exponent a is significantly 
different from 0, it implies that the PDF of return intervals may 
show multiscaling behavior. 
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Figure [3] plots the exponent a as a function of order m for 
the two Chinese indices. This figure shows that the exponent 
a differs from in a systematic fashion. The exponents for 
the two indices are very close to each other when m is small. 
For m < 1, a is negative. The exponent a increases with m 
when m < 3 and decreases afterwards owning to the finite size 
effect. For large order m, the a value for the Shenzhen Com- 
posite Index is greater than that for the Shanghai Composite 
Index. This implies that large (r) tends to occur with greater 
probability for the Shenzhen Composite Index than the Shang- 
hai Composite Index, since large (r) contributes more for high 
order \i m . 



In other words, (i m . n is independent of (r™) 1 /". 

Our empirical test focuses on the case that n = 1. This ESS 
framework was also used to investigate scaling in the exit times 
in turbulence 041 and intertrade durations [35). According to 
Eq. (O 

a(m) = £(m, l)/m — 1, 



Since £(1, 1) = 1, we have a(l) = when m 
verified by Figure [3] 



4.2 Dependence of moment on order m 



(18) 
1. This is well 
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Fig. 3. (Color online) Exponent a of fj, m in the region 10 < (r) < 
100 for the two Chinese indices: Shanghai Composite Index and Shen- 
zhen Composite Index presented by circles and squares respectively. 

The above analysis according to Eq. ( fTUb can be related to 
the extended self-similarity (ESS) analysis 1331 . which reads 



If the generalized variable of [i m 



/ T n\l/n 



»H 1 / m / T m\l/m 



(ID 



(12) 



scales as 



^n~l(r n ) 1/n ) (13) 



together with Eq. (fTTb . we have 

(a + l)/n = £(rn, n)/m. (14) 
If the return interval distribution can be scaled as follows 



we obtain that 



In this case, we have 



£(m, n) = m/n. 



a = 0. 



(15) 

(16) 
(17) 



The moment fi m not only display a significant dependence on 
(t), but also shows a systematic tendency with m as shown in 
Figure|2] It is interesting to investigate the relation between /z m 
and m directly. For a fixed (r), one can study the moment of 
t of various orders m. If the return interval distribution strictly 
obeys a scaling form, fj, m should not depend on (r), and the /z m 
curves for different (r) should all collapse onto a single curve. 
We plot in Figure |4ja) the moment fj, m as a function of m for 
fixed (r) = 10, 30, 100 corresponding to q = 2.0, 3.2, 4.8 for 
the Shenzhen Composite Index. One sees the curves for dif- 
ferent (r) exhibit substantial deviations from a single curve, 
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Fig. 4. (Color online) (a) Moment (i m vs m for Shenzhen Composite 
Index for (r) = 10, 30, 100 corresponding to q = 2.0, 3.2, 4.8, pre- 
sented by circles, squares, triangles respectively, (b) Analytical mo- 
ments obtained from the stretched exponential distributions with pa- 
rameters fitted from empirical data of Shenzhen Composite Index for 
9 = 2,3,5. 
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which demonstrates the multiscaling behavior of return inter- 
vals. For small m (m < 1), fj, m decreases with the increase of 
(t), while for big m (m > 1) \i m tends to increase with the 
increase of (r). This is not difficult to understand since small r 
dominates fi m for small order m and large r dominates \i m for 
large order m. The situation for the Shanghai Composite Index 
is very similar. 

We have demonstrated that the return interval distribution 
follows a stretched exponential form with different parameters 
for various thresholds q in Section 13.31 For the data perfectly 
follow a stretched exponential distribution, we can calculate 
the analytical result of the moment fi m by substituting Eq. © 
to Eq. (O and considering the normalization condition of prob- 
ability density. It follows immediately that ll26l 



r((m + i)/ 7 ) 



-i 1/m 



(19) 



In Figure Hfb), the analytical curves of /x m versus m for three 
stretched exponential distributions fitted from empirical data of 
Shenzhen Composite Index for q = 2, 3, 5 are plotted. As one 
can see that the analytical results are similar to that of the em- 
pirical data, which supports the multiscaling of the empirical 
return intervals. 



5 Summary and conclusions 

We have studied the multiscaling properties of the distributions 
of volatility return intervals for two Chinese indices, together 
with their moments. The Kolmogorov-Smirnov test is adopted 
to examine the scaling behavior of the return interval distribu- 
tions as well as the particular form of the distribution. We find 
that the return intervals of the two indices exhibit multiscal- 
ing behaviors, and their distributions for different thresholds q 
can be well approximated by stretched exponential functions 
fq(x) ~ e~ ax but with different values of the correlation ex- 
ponent 7. An ESS-like moment analysis confirms the existence 
of multiscaling rather than monoscaling. This result is consis- 
tent with previous analysis on individual Chinese stocks l29l 
and help us better understand the properties of volatility return 
intervals in the Chinese stock markets. 
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